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O ■ Abstract: 



We use recent neutrino dimuon production data combined with a global deep-inelastic 
■ parton fit to construct a new parton set, NNPDF 1.2, which includes a determination of 

^ - the strange and antistrange distributions of the nucleon. The result is characterized by a 

faithful estimation of uncertainties thanks to the use of the NNPDF methodology, and is 
free of model or theoretical assumptions other than the use of NLO perturbative QCD and 
exact sum rules. Better control of the uncertainties of the strange and antistrange par- 
ton distributions allows us to reassess the determination of electroweak parameters from 
the NuTeV dimuon data. We perform a direct determination of the \Vcd\ and \Vcs\ CKM 
matrix elements, obtaining central values in agreement with the current global CKM fit: 
specifically we find \Vcd\ = 0.244 ± 0.019 and \Vcs\ = 0.96 ± 0.07. Our result for |Ks| is 
more precise than any previous direct determination. We also reassess the uncertainty 
on the NuTeV determination of sin^ Oy/ through the Paschos-Wolfenstein relation: we 
find that the very large uncertainties in the strange valence momentum fraction are suffi- 
cient to bring the NuTeV result into complete agreement with the results from precision 
electroweak data. 
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1 The strange content of the nucleon 



The determination of the strange and antistrange quark distributions of the nucleon is of 
considerable phenomenological interest, because many final states in the standard model 
and beyond couple directly to strangeness. A notable example is the determination of the 
elcctrowcak mixing angle by the NuTcV collaboration [1], which might provide evidence 
for physics beyond the standard model, and which is very sensitive [2] to the strange 
content of the nucleon. 

Unfortunately, the bulk of the data which are used for parton determination, namely 
neutral-current deep-inelastic scattering, have minimal sensitivity to flavour separation, 
and no sensitivity at all to the separation of quark and antiquark contributions. As a conse- 
quence, until very recently in standard parton fits such as CTEQ6.5 [3] and MRST2006 [4], 
the strange and antistrange quark distributions were not determined directly: rather, they 
were assumed to be equal, and then proportional to the total light antiquark sea distribu- 
tion. The only available attempt at a determination of the strange and antistrange dis- 
tributions [5] was based on a re-analysis of old (mostly bubble-chamber) charged-current 
neutrino-nucleon scattering data: unfortunately, the quality of these old data was insuffi- 
cient for a reliable determination. 

This situation has changed recently, due to the availability of a wider set of inclusive 
neutrino deep-inelastic scattering data [6, 7] and, more importantly, of data for deep- 
inelastic neutrino and anti- neutrino production of charm [8-10] ("dimuon" data, hence- 
forth), which is directly sensitive to the strange and antistrange parton distributions. 
As a consequence, dedicated analyses of the strange quark distribution have been per- 
formed [11-14], and independent parametrizations of the strange and antistrange distri- 
butions are included in most recent parton fits [15]. However, the standard method of 
parton determination used in all these references, which is based on fitting the param- 
eters of a fixed functional form, is known to be hard to handle when the experiments 
are relatively unconstraining. Indeed, it is not uncommon that the addition of new ex- 
perimental information to a parton fit of this kind, actually leads to an increase rather 
than a decrease of uncertainty bands (see e.g. [16]), because the new data require the 
use of a more general parametrization. This hampers a direct statistical interpretation 
of the uncertainty bands on parton distributions obtained in this way: indeed, in some 
of these parton determinations [15, 17] experimental uncertainties are infiated by suitable 
"tolerance" criteria. Precision measurements are thus very difficult to obtain whenever 
the results are significantly affected by parton uncertainties. This is clearly the case in the 
extraction of the electroweak mixing angle from the NuTeV data of Ref. [1] , and it could 
be more generally an issue for LHC observables which depend crucially on the strange 
distribution, such as the "standard candle" (Tz/(yw [17]. 

A method of parton determination which is free of these difficulties was developed 
by us in a series of papers [18-20], and has led recently to the construction of a full 
parton set based on a fit to a global set of deep-inelastic scattering data: NNPDFl.O [21]. 
This method is based on the use of neural networks for parton parametrization, and a 
Monte Carlo method supplemented by a suitable training and stopping algorithm for the 
construction of the parton fit. In this approach, parton distributions are given as a Monte 
Carlo sample representing their probability distributions as inferred from the data: so, for 
instance, uncertainties can be obtained from the sample by computing standard deviations. 
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likelihood intervals by determining frequency histograms, and so on. 

It was shown that this methodology is largely free of bias related to parton parametriza- 
tion, and it handles in a satisfactory way incomplete information, contradictory data, and 
the addition of new data within a single framework. In particular, in Ref. [21] it was explic- 
itly verified that when data are removed by changing the kinematic cuts, the uncertainty 
bands widen in such a way that results before and after the cuts remain compatible, while 
results outside the data region directly affected by the cuts remain stable. In Ref. [22] 
it was further checked that the same behaviour is observed when the whole dataset is 
altered, e.g. by removing all data from one or more experiments: a fit to a smaller dataset 
has wider uncertainties, but remains compatible with the fit to the larger dataset. 

That these stability properties of the NNPDF approach apply also to the way the 
strange distribution is treated was shown in a dedicated study based on the same method- 
ology [23], leading to the NNPDFl.l parton set. In NNPDFl.l, the strange parton distri- 
butions = Sits are parametrized by two independent neural networks, instead of being 
taken to be proportional to the light antiquark distribution as in NNPDFl.O. However, 
the dataset is the same as for NNPDFl.O: so the s"*" distribution is only very weakly 
constrained, and the s~ essentially unconstrained by the the data. Nevertheless, when 
results of this pair of fits are compared, they show remarkable stability, despite the fact 
that each neural network is parametrized by a very redundant set of parameters (the ad- 
dition of two neural nets results in the addition of 74 extra free parameters in the fit). 
Indeed, parton distributions which are unaffected by the addition of independent strange 
degrees of freedom (such as the gluon) are unchanged, and the only marked effect of the 
independent parametrization of strangeness is an increase, by about a factor two, of the 
uncertainty on the total valence quark distribution {u — u + d — d + s — s). Remarkably, 
statistical analysis of the NNPDFl.O set alone was already sufficient to show [21] that the 
uncertainty on this combination was underestimated. 

In this paper, by adding recent dimuon data to the global deep-inelastic scattering 
dataset on which the NNPDFl.O and NNPDFl.l fits were based, we construct a new 
parton set, NNPDF1.2, which includes a determination of the strange and antistrange 
distributions. Furthermore, we determine directly the \Vcs\ and \Vcd\ CKM matrix ele- 
ments which control the strength of the charged-current coupling to neutrinos in dimuon 
production of the strange and down quarks respectively, and we use our determination of 
the strange quark distribution to compute the correction to the Paschos-Wolfenstein ratio 
to be used in extractions of the electroweak mixing angle. 

We find that the shape of the strange and antistrange distributions which are com- 
patible with data are rather more general than those obtained in other recent stud- 
ies [11-15,17]. Our uncertainty on the ratio Ks = [-5+]/ [U + Dj of strange to light 
sea momenta is rather more asymmetric than hitherto assumed: Ks [Q^ = 20GeV^) = 
0.71^Q3^'^*'^*. This may have nontrivial implications for LHC observables, such as the Z/W 
cross section ratio mentioned above. Despite these increased uncertainties, we find that, 
perhaps surprisingly, the dimuon data are sufficient to determine \Vcs\ = 0.96 ± 0.07*°*. 
This is one order of magnitude more precise than any other direct determination from 
neutrino deep-inelastic scattering, and is comparable to the current PDG best average of 
direct determinations from D meson decays, {\Vcs\ = 1.04 ± 0.06 [24]), though still two 
orders of magnitude worse than the results of a global CKM fit. The related CKM ele- 
ment \Vcd\ is also determined, \Vcd\ = 0.244 it 0.019*°*, with a similar accuracy to other 
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determinations from dimuon data. 

We further find that the s — s distribution, which must change sign as a function of x in 
order for the total nucleon strangeness to vanish, can do so in a wide variety of ways, and 
that its sign at any given x is not weU determined. As a consequence, the uncertainty in the 
strange valence momentum fraction, and thus in the correction to the Paschos-Wolfenstein 
ratio, is much larger than hitherto assumed, and is such that the NuTeV measurement of 
sin^ 6w is actually in complete agreement with determinations from precision electroweak 
data once this uncertainty is taken into account. 

Many of the techniques and tools that we use in this paper are part of the standard 
NNPDF methodology, already described in detail in Refs. [19-21] and used there for the 
construction of the NNPDF 1.0 parton set. Here we will focus on the new aspects of 
the NNPDF1.2 set, and then discuss our main results. Thus in Sect. [2] we describe the 
dimuon cross section and its available experimental determinations, and in Sect. [3] we will 
give its expression in perturbative QCD and thus its relation to the strange distribution, 
and discuss the way the strange and antistrange distributions are treated, as well as 
some specific theoretical issues related to the treatment of this observable, such as the 
treatment of the charm mass and of nuclear corrections. Full details of the hard kernels 
used to construct the physical observables are given in Appendix O In Section [4] we 
present our determination of the strange and antistrange distributions, specifically their 
shape and their contribution to the nucleon momentum, and compare them to results 
obtained by other groups. In Section [5] we will discuss in detail the implications of our 
results for precision electroweak measurements, and discuss specifically the determination 
of the CKM matrix elements \Vcs\ and \Vcd\ and the impact of our results on the NuTeV 
determination of the electroweak mixing angle. 
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Figure 1: Experimental data in the {x, Q^) plane used in the NNPDF1.2 analysis after kinematic 
cuts. 



2 Experimental data 

The NNPDF1.2 parton determination is based on the same data set used for NNPDFl.O, 
supplemented by data on deep-inelastic neutrino production of charm from NuTeV [8, 25] 
which give us a handle on the strange distribution, whose determination is the main goal of 
this paper. We also add to the data set some recently published measurements of neutral 
current and charged current deep-inelastic cross sections by the ZEUS experiment based 
on HERA-II data [26,27]. 

An earlier measurement of the dimuon cross section using the same detector (but 
a different beam-line) was performed by the CCFR collaboration [28]. This previous 
measurement is significantly less accurate and its compatibility with the NuTeV data is 
debatable [8,14]; we will not include it in our fit. A recent measurement of the dimuon 
cross section has also been performed by the CHORUS collaboration [9]; unfortunately, 
however, only the results of a leading-order QCD analysis of this data have been published, 
and not the cross-section data themselves, which therefore cannot be used in our analysis. 

The treatment of experimental data in the present fit follows Ref. [21]. In particular, all 
information on correlated systematics is included in our fit, in that the full covariance ma- 
trix is computed including all available correlated uncertainties (including normalization 
uncertainties) . 

Below we give more explicit details of the new data and corresponding observables 
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ZEUS06NC 


90 (90) 


.005 


0.65 


200.0 


30000.0 


2.6 




[26] 


ZEUS06CC 


37 (37) 


.015 


0.65 


280.0 


30000.0 


14 




[27] 


NUTEV Dimuon 


















NuTcV Dimuon v 


45 (43) 


.0267 


0.37 


1.1 


116.5 


19 




[25] 


NuTcV Dimuon v 


45 (41) 


.021 


0.25 


0.8 


68.3 


23 




[25] 


Total (including Tab. 1 of Ref. [21J) 


4165 (3372) 





Table 1: Further experimental data included in the present analysis in addition to those given in 
Table 1 of Ref. [21]. We show the number of points before (after) applying kinematic cuts, the 
kinematic range, the average total uncertainty after cuts and the observable which is measured. 
Different sets within an experiment are correlated with each other, while data from different ex- 
periments are uncorrelated. The total number of data points refers to the full dataset. 

which have been included in the current fit. 

2.1 Data set, uncertainties and correlations 

The data set used for the present fit is obtained by supplementing the data set used for the 
NNPDFl.O fit, as summarized in Table 1 of Ref. [21], with the data summarized in Tabled] 
given here. A scatter plot of the full data set is displayed in Figure [H Note that NuTeV 
dimuon data overlap with the rest of fixed target experiments, providing information on 
the proton strangeness for x > 10^^. 

The covariance matrix is computed for all the data included in the fit, as discussed in 
Ref. [21]. The NuTeV dimuon data are affected by a common normalization uncertainty 
of 2.1% [6]; eight correlated systematics; and a statistical uncertainty. The statistical 
uncertainty is around 15% for neutrino and around 25% for anti-neutrino data, while 
correlated systematics are generally smaller by a factor between three and five. This 
dominant statistical uncertainty is affected by a bin by bin correlation due to the unfolding 
procedure used in extracting the dimuon cross section from the measured observable. The 
covariance matrix which describes these correlations is not available. Its effect has been 
summarized in Ref. [25] by providing for each bin an "effective number of degrees of 
freedom", which provides the expected value of the best-fit to the given data bin, i.e., 
effectively, a rescaling for the statistical error. These rescaling factors can be as low as 
30%, and are typically around 50%, indicating sizable correlations. 

Rescaling of statistical errors in order to account for missing correlations could bias 
the fit in an unpredictable way and it is a dangerous procedure if the information on the 
covariance matrix is lost. On the other hand, only including correlations for the sub- 
dominant systematic errors could lead to an underestimate of the relative impact of these 
uncertainties. Hence, because the covariance matrix of the NuTeV data is unfortunately 
unavailable, the only consistent procedure for the treatment of these data is to add all 
uncertainties in quadrature, and only consider normalizations as correlated uncertainties. 
This is the procedure that we shall follow. 

2.2 Observables, kinematic cuts and pseudo-data sample 

The set of observables considered in these fits consists of the structure functions and 
reduced cross-sections considered in Ref. [21] and summarized in Table 1 of that reference. 
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Experiment 


ZEUS-HERA-II 


NuToV Dimuon 


Total 


r [Ft"')] 


-4.2 -10"* 
0.999 


-5.4 -10-* 
0.999 


-2.3 -lO"* 
0.999 


/pB|/^(art)\ |\ 

/^(oxp)\ 

\ /dat 

/<,(art)\ 


6.5 -lO^"^ 
13.79% 
13.88% 

0.999 


-2.6 -10^'^ 
21.23% 
21.17% 
0.998 


-6.1 -10^^ 
11.24% 
11.24% 
0.999 


\ /dat 


0.287 
0.294 
0.994 


0.034 
0.034 
0.978 


0.146 
0.146 
0.996 


/cov(<=''P) \ 

\ /dat 

(cov<"')) 
r cov("'' 1 


6.89 -10-* 
7.03 -10-* 
0.997 


0.169 
0.168 
0.988 


1.61 -10"^ 
1.64 -10"^ 
0.988 



Table 2: Statistical estimators for the Monte Carlo artificial data generation with A^rep = 1000, 
for experiments not included in Ref. [21]. The definition of the statistical estimators is given in 
Appendix B of [20] . The faithfulness of the Monte Carlo sampling of experimental data is assessed 
quantitatively by these estimators. 



supplemented by the dimuon cross section. Neutrino dimuon production is induced by 
charm production through charged current interactions of neutrinos with the target nuclei, 
followed by the fragmentation of the charm quark into a charmed hadron and its decay 
into a muon. The corresponding cross section is given by 



1 (i^o-^^^)''^ 



dx dy 



G^Mn 



27r(i + qyu^ 



2 \2 



m. 



1 + 7^ +y 



Q2 V V 



where 



Q2 = 2MNE^xy, 



F± = l±(l-y)2. 



(1) 



(2) 



The charm production cross section is obtained from the published NuTeV neutrino 
dimuon production cross sections [25] as 



1 dV'^^^)'^ 
El, dx dy 



{x,y,Q^) 



1 d2o-'^(^)'2/^ 



(Br {D ^ ij.)) - A (x, y, E^) E^ dx dy 



{x,y,Q^), (3) 



where (Br (D /i)) is the average branching ratio of charmed hadrons into muons and 
A {x, y, E^) is a bin-dependent experimental acceptance correction. Acceptances are pro- 
vided by the NuTeV collaboration, based on a leading-order model [29]; next-to-leading 
order acceptances [30] (not publicly available) differ by less than 3% from the leading- 
order ones. The branching ratio used in the NuTeV analysis [10] comes from a reanalysis 
of the emulsion data of the FNAL E531 experiment and turns out to be (Br (D — > /x)) = 
0.099±0.012, in agreement with other determinations [9,31]. A simultaneous extraction of 
this parameter along with the determination of strangeness in Ref. [14] leads to a similar 
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result. In the determination of the dimuon cross section, the branching ratio will be set 
equal to the central value used in the NuTeV analysis [10]. The associated uncertainty 
will then be included in our fit as discussed in Section 13.21 below. 

Our data set is obtained by imposing on all the data listed in Table 1 of Ref. [21] and 
in Table [U the same kinematical cuts as in NNPDFl.O, namely > Q^^^ = 2 GeV^ and 
W'^ > 12.5 GeV^. After these cuts, 84 out of the 90 NuTeV dimuon data points are left. 
After cuts, the total number of data points in the NNPDF1.2 analysis is N^at = 3372. 

Error propagation from the experimental data to the fit is performed through a Monte 
Carlo procedure, described in detail in Ref. [21], by generating a set of 1000 pseudo-data 
replicas, whose faithfulness can be verified by studying suitable statistical estimators. The 
statistical estimators for the new data sets included in the present fit, as well as for the 
global data set, are summarized in Table [2j 
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3 Neural networks, parton distributions and physical ob- 
servables 



Physical observables are determined from a set of PDFs given at a reference scale, which 
are in turn parametrized in terms of neural networks, according to the formalism discussed 
in detail in Sect. 3-4 of Ref. [21]. Here we summarize the new features of this determination: 
the use of an independent parametrization for the strange and antistrange distribution and 
its construction in terms of neural networks, the new physical observables used for dimuon 
data, and some issues that require reconsidering when dealing with this observable, namely 
the treatment of the charm mass and nuclear corrections. 

3.1 Parametrization of the strange PDF 

In the NNPDFl.O fit of Ref. [21], parton distributions were parametrized using five inde- 
pendent neural networks: four independent linear combinations of the two light flavours 
and anti-flavours, and the gluon. The strange and antistrange quark distributions were as- 
sumed to be given hy s = s = k{u + (1) /2 with k = 0.5, and heavy quarks were generated 
dynamically, using a zero-mass variable flavour number scheme (ZM-VFN). In the subse- 
quent NNPDFl.l fit [23], two further neural networks were introduced to parametrize the 
strange and antistrange quark distributions. Here, as in Ref. [23] we parametrize parton 
distributions in terms of seven independent neural networks, as we now discuss. 

The primary partonic quantities out of which all physical observables are built up are 
the gluon, the singlet quark distribution, the total valence quark distribution, and ten 
nonsinglet combinations of the valence {qi — g^) or total (g^ -|- g,) quark and antiquark 
distribution for the i-th quark flavor. These are constructed as in Ref. [21], to which 
we refer for more details. The starting scale is chosen at the charm threshold, where the 
charm distributions are assumed to vanish, and the remaining six light quark distributions 
and the gluon distribution are parametrized in terms of independent neural networks. The 
possibility of introducing an intrinsic charm distribution will not be studied in the present 
fit, though there is no obstacle to including it in future studies. 

The four light non-strange distributions and the gluon distribution are parametrized 
in terms of neural networks as in Ref. [21], by letting 

fix, Ql) = Af{l- x-^fNNf{x), (4) 

where /{xjQq) is a linear combination of parton distributions, and NNj(2;) is a multi- 
layer feed-forward neural network with two intermediate layers and architecture 2-5-3-1, 
parametrized by 37 free parameters (weights and thresholds) . The constants A f are either 
simply set to one, or else used to enforce the valence and momentum sum rules. 

The preprocessing function (1 — x)'^^ x~^f is included in order to speed up the conver- 
gence of the fit: the neural network only has to fit the deviation from the behaviour of the 
preprocessing function, whose exponents are thus fixed to values which absorb some of the 
gross behaviour of the function /(x, Qq) without biasing the result (i.e. without imposing 
a steep growth or fall which '^^N{x) would have trouble in reabsorbing). Independence of 
the results on the choice of the preprocessing exponents was verified in Ref. [21] by varying 
them within a reasonable stability range. This stability range is identified in Ref. [21] by 
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PDF 


m 


n 




[2.7,3.3] 


[1.1,1.3] 




[3.7,4.3] 


[1.1,1.3] 




[2.7,3.3] 


[0.1,0.4] 




[2.7,3.3] 


[0.1,0.4] 


^s{x.Ql) 


[2.7,3.3] 


[0,0.01] 


s+{x,Ql) 


[2.7,3.3] 


[1.1,1.3] 


s~{x,Ql) 


[2.7,3.3] 


[0.1,0.4] 



Table 3: The range of variation of the randomized preprocessing exponents used in the present 
NNPDF1.2 fit. 



requiring the quality of the fit to be unchanged as the exponents are varied. A small resid- 
ual dependence on the preprocessing exponents was found in Ref. [21] for the triplet and 
total valence quark distributions. In order to be able to disentangle accurately the strange 
contribution it is important that uncertainties on all light quark flavours are estimated as 
precisely as possible: for this purpose, in the NNPDFl.l fit of Ref. [23] and in the present 
fit all preprocessing exponents are randomized: a different value is taken for each Monte 
Carlo replica, uniformly distributed within the stability range. 

The choice of linear combinations of the two lightest flavours which are parametrized 
independently according to Eq. dH) is the same in the present fit as in NNPDFl.O. On 
top of them, we add two independent neural networks in the strange sector, in order to 
parametrize 

s^{x,Q'^) = s{x,Q'^)±s{x,Q^) (5) 

according to 

s+{x,Ql) = (l-a;)™=+x-"»+NN,+ (x) , (6) 
s-{x,Ql) = (l-x)™-x-"-NN,-(x)-Saux(x,Qg), (7) 

saux(2;, Ql) = A,^ [x'^- (1 - x)*- ] . (8) 

The exponents m, n of the preprocessing functions are randomized as discussed above, 
and their ranges are also listed in Table [3l 

The contribution Saux(2;,Qo) -^1- © introduced in order to enforce the strange 
valence sum rule: the constant Ag- is fixed by requiring 

I dxs'ix) = 0, (9) 
Jo 

which gives the condition 



where 



As- = , r (1 - x)™=- X-"- NN,- {x)dx. (10) 







Clearly, the sum rules requires s~ to change sign at least once. This way of implementing 
the sum rule is designed in order to ensure that this crossing happens naturally in the 
valence region, rather than in some contrived way outside the data region where the shape 
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of s~ is completely unconstrained. To this purpose, the exponents r^- , tg- are chosen in 
such a way that Sa_ux{x,Qo) peaks in the valence region, and that the small x and large 
X behaviour of s~{x,Q'^) are not controlled by the s^uxi^jQ^) contribution. In practice 
the latter condition is enforced by requiring r^- > —rig- and tg~ > rrig- , while the former 
is enforced by letting r^- = tg-/k, which sets the maximum of Saux(3;,Qo) at x = 
We then choose tg- = 3.5, and take A; as a uniformly distributed random number in the 
range k G [1,3]. The consequences of this very flexible implementation of the strangeness 
valence sum rule will be discussed in Sect. 14.41 below. 

3.2 The dimuon physical observable 

The NNPDF1.2 data set, displayed in Fig.[Tl contains data for the same set of observables 
discussed in Ref. [21], with the addition of the dimuon cross section Eq. ([l]). The latter 
is determined by the charm structure functions p^^'^^'^ ^p'^^'^^'^ and xFg which in the 
quark model are given by 

F^'P'^ix, Q2) = xF^^P^'ix, g2) = 2x {\V,df d{x) + 1^^.!' s{x) + \V,bf b{x)), (11) 
F^^P'^ix, Q2) = -xF^'P'^^ix, Q2) = 2x d{x) + \V,g\^ s{x) + \V,b\^ b{x)) , (12) 

with f'^^^'^ = 0. Full expressions for these structure functions in perturbative QCD at 
any scale in terms of the basis of PDFs used in our fits are given in Appendix |Al 

Because they are not inclusive with respect to the final state quark flavour, these 
structure functions depend on CKM matrix elements. These are extremely well determined 
by current global fits including unitarity constraints; for our global fits we will use the 
current best-fit PDG [24] values: uncertainties on them are tiny and will be neglected. In 
Section H] we will then study the quality of our fit as the parameters | Vcs \ and | Vcs \ are 
varied without the unitarity constraint, and use this to provide a direct determination of 
these parameters from the dimuon data. 

Also, as already discussed in Sect. [21 the dimuon cross-section Eq. ^ depends on the 
branching ratio (Br (L> ^)). The uncertainty in this is actually rather significant: in 
previous analyses [10,14] of dimuon data this turned out to be one of the dominant sources 
of uncertainty. To take account of this uncertainty, the value of the branching ratio used 
in the fit has been randomized about its central value, analogously to the procedure used 
for the preprocessing exponents, with a Gaussian distribution of width equal to the stated 
uncertainty (Br {D //)) = 0.099 ± 0.012 [10]. 

3.3 Treatment of the charm mass 

In the previous NNPDFl.O and NNPDFl.l parton determinations, heavy quarks were 
treated in a zero mass variable flavour number (ZM-VFN) scheme, as discussed in Sect. 3.4 
of Ref. [21]. Contributions suppressed by powers of the heavy quark mass, i.e. of order 
m|/Q^, which are neglected in this scheme, have a small but not entirely negligible im- 
pact [32], mostly through the initial condition on the charm distribution, which then affects 
all other PDFs due to the momentum sum rule. For the purpose of the present analysis, 
an improved treatment of the charm mass is advisable at least for dimuon production, as 
the dimuon cross section measures charm production, and a sizable fraction of the NuTeV 
dimuon data are thus at scales close to the charm mass. 
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Figure 2: Comparison of the ZM and I-ZM computations of the dimuon cross section 
(Pa'^'y^^''^ I dx dy for typical NuTeV kinematics. All cross sections in the plots are rescaled by a 
factor Ai, — 1Q0/G\MnEI. The neutrino kinematic parameters {E^,y) are related to x and 
by Eq. Results for anti- neutrinos are very similar. 



To this purpose, we employ (for the dimuon observable only) the improved ZM-VFN 
(I-ZM-VFN) scheme, proposed in Ref. [33] and discussed in detail in Ref. [34]. There, it 
was shown that the bulk of the charm mass effects near threshold can be accounted for by 
requiring that the threshold for the inclusion of heavy quarks in the sum over final states 
be set at its physical value W"^ = m^, and that the phase-space constraint due to the heavy 
quark mass be respected in convolution integrals. The latter requirement is in practice 
implemented by replacing the Bjorken x variable by a rescaling variable Xc defined as 

Xe^x(l + ^). (13) 

Whereas results obtained with this I-ZM-VFN scheme are in fair agreement with those 
obtained with a full treatment of the quark mass (so-called general mass, or GM scheme), 
and in rather better agreement with the data, they may lead to an excessive suppression of 
heavy quark production: to this purpose, in Ref. [34] a one-parameter family of rescaling 
variables has been constructed, such that the agreement with the GM scheme can be op- 
timized by tuning this parameter. It turns out, however, that the simplest choice Eq. p3p 
is actually very close to the optimal one for charged current deep inelastic scattering. 

Hence, in the present analysis we will use the ZM-VFN for all inclusive observables, 
but for the dimuon cross section Eq. ([3]) we will use the I-ZM-VFN of Ref. [33]. In practice. 
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CHORUS data 



NuTeV dimuon data 




Figure 3: Comparison of the nuclear corrections to the reduced cross sections for inclusive CHO- 
RUS data (left) and for NuTeV charm production data (right) from the de Florian-Sassot [35] 
and HKN07 [36]parametrization. The correction is shown for individual experimental data points, 
versus the point label (arbitrary order). 



this means that we will retain the full rric dependence in Eq. ([T]), and in the expressions 
for the structure functions F^''^ Eq. (j39[) all convolutions are defined as 

[C q] (x, Q') =e{W'- ml) f [y, a. (Q^)) q (^,qA . (14) 

The impact of this treatment of the charm mass is shown in Fig. [2l where we compare 
a NLO determination of the dimuon cross section Eq. ([3]) within the ZM-VFN and I-ZM- 
VFN schemes, based on our previous NNPDFl.O parton set. The suppression of the cross 
section at small x due to finite quark mass is apparent from this plot. Clearly, the inclusion 
of quark mass effects only in the determination of the dimuon cross section, and then in 
the I-ZM-VFN scheme, is an approximation. This approximation will lead to a systematic 
uncertainty in our determination of the strange PDFs and of CKM matrix elements in 
the next sections. We will estimate this uncertainty by comparing results obtained in the 
ZM-VFN and I-ZM-VFN scheme: as the full GM scheme is actually in between these two, 
this provides a rather conservative overestimate of the associated uncertainty. We will 
then see that this systematic uncertainty is actually small in comparison to the statistical 
uncertainty on strangeness and associated observables. 



3.4 Nuclear Corrections 

Neutrino data are obtained from deep-inelastic scattering off a nuclear target: for NuTeV 
essentially Fe, ^NuTeV = 49.6 [25], and for CHORUS (whose inclusive structure function 
measurements are also included in our data set) Pb, ^chorus = 207, [7]. Therefore, a 
suitable nuclear correction should be introduced in order to obtain from these data a 
determination of the PDFs of free nucleons. 

Nuclear corrections have been determined by various groups [35-38], using models of 
nuclear structure. The correction 
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CHORUS inclusive I NuTeV diamn 




Figure 4: Nuclear corrections to the neutrino structure function F2 for inclusive CHORUS data 
(left) and for NuTeV charm production data (right) from the Kulagin-Petti parametrization [37]. 
The correction is shown in the full kinematic region relevant for both experiments. 

to the reduced cross sections o"'^^^^ and cj^^'^^''^, obtained using the parametrizations of 
Refs. [35,36], are displayed in Figs. S] for the experimental CHORUS inclusive and dimuon 
NuTeV data. It is apparent that corrections obtained using different models can be signif- 
icantly different, but they are all quite small. For this reason, nuclear corrections were not 
used in the NNPDFl.O fit [21]. In the NNPDF1.2 fit presented here we will not include 
nuclear corrections in our baseline fit, but, in order to determine the associated systematic 
uncertainty, we will repeat the fit with the nuclear corrections computed using the models 
of Refs. [35,36], which provide corrections to the parton distributions. The dependence of 
the nuclear correction on the kinematic variables is shown in Fig. [J] in the kinematic region 
and for A values relevant for CHORUS and NuTeV data, using the model of Ref. [37], 
which instead provides directly a correction to the structure function. 
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Armax 




^ * cop 


^scts 


-^update 


[10,1] 


[10,1] 


[1,0.1] 


[1,0.1] 


[1,0.1] 


[5,0.5] 


[1,0.1] 


5000 


1/3 


120 


3 


10 



Table 4: Parameters controlling the genetic algorithm minimization. Since we work with 
-^mut = 2 there are two entries in each column for the values of r]^^\ 

4 Results 

In this section we present the NNPDF1.2 parton set. After discussing the general features 
of the fit and its result, and comparing these to the previous NNPDF parton set, we 
discuss in detail the determination of the strange and antistrange distributions, which are 
the novel features of this fit. We finally compare results to experimental data, including a 
comparison with the recent [26] determination of the structure function. 

4.1 The NNPDF1.2 parton set: statistical features 

We have produced a set of A'rep = 1000 replicas of seven PDFs, each determined as an 
optimal fit to one of the Monte Carlo replicas obtained from the data set of Sect. [21 
We have used the genetic algorithm minimization and a cross-validation method for the 
determination of the optimal fit, according to the method presented in Sect. 4 of Ref. [21]. 
The parameters of the genetic algorithm are summarized in Table [H they coincide with 
those used in Ref. [21] for the five PDFs already present in that fit. 

The general statistical features of our final parton set are summarized in Tables [SMl to 
be compared with the corresponding tables (Tables 7-8) of Ref. [21], where all the relevant 
quantities are defined (note that average uncertainties are now given in percentage value, 
while they were given as absolute values in Ref. [21]). 

The statistical features of the fit can be summarized as follows: 

• The general features of the total fit (Tab. [5|) are essentially indistinguishable from 
those of Ref. [21], and the comments we made then still apply. The same is true for 
the features of the fit to individual experiments (Tab. [6]) when these were already 
included in the dataset of Ref. [21]. This stability upon the addition of two new 
independent PDFs (thus 74 extra free parameters) and a randomization of the pre- 
processing exponents supports the reliability of the results obtained in NNPDFl.O 
for all PDFs which were determined there. 

• The quality of the fit to the new HERA II data is comparable to that to the BCDMS 
data, and somewhat worse than that of the fit to other HERA data. These new data 
mostly probe the large x region, like BCDMS and unlike other HERA data (see 
Fig. [T]), and are generally rather precise, also like BCDMS and unlike other HERA 
data (see Tab. [T]and Tab. 1 of Ref. [21]). This somewhat larger value of the for 
large x high precision data, though compatible with statistical fluctuations and with 
the theoretical error related to the use of NLO perturbation theory, may suggest 
some minor data incompatibility in this region. 

• The of the fit to dimuon data is rather smaller than one. This is a consequence 
of the fact that, as discussed in Sec. 12.11 correlations have not been included for 
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Xtot 


1.31 


{E) 


2.80 


(Etr) 


2.75 




9 fin 


(TL) 


1024 




11.0% 
4.0% 




0.15 
0.32 


(cov(-*)),^. 


1.6 10-^ 


6.1 10^3 



Table 5: Statistical estimators for the final PDF set with A^iop = 1000 for the total data set. 
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Table 6: Statistical estimators for the final PDF set with A'rcp = 1000 for individual experiments. 

these data because the covariance matrix is not available. The average value of the 
we obtain is in good agreement with that expected on the basis of the "effective 
number of degrees of freedom" published in Ref. [25], and with other fits to the same 
data [14]. 

• The uncertainty of the fit to dimuon data, as measured by the average standard 
deviation (a) is very close to the uncertainty of the data, unlike that of all other 
data sets (reflected by the results for the total fit), where the fit uncertainty is 
much smaller than the data uncertainty (4% vs. 11% for the total fit). This is a 
consequence of the fact that dimuon data have little redundancy, and are sensitive 
to strangeness, to which other data are essentially insensitive; while all other data 
have a very large redundancy, especially low-x HERA data which depend mainly 
on the quark singlet and gluon. This effect can also be observed in the comparison 
between experimental data and NNPDF1.2 predictions of Fig. [TSl 

• The average correlation is very low for the dimuon data, because the only correlated 
systematics is normalization. However, the fit to these data does display a corre- 
lation of the same order of magnitude as for other data, reflecting the underlying 
smoothness of parton distributions. 
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Figure 5: The singlet and gluon PDF at the starting scale Q^ = 2 GeV^, plotted versus a: on a log 
(left) or hnear (right) scale. The PDFs from the previous sets NNPDFl.O [21] and NNPDFl.l [23] 
are also shown for comparison. Note that while the PDFs from NNPDF1.2 and NNPDFl.O have 
been computed with TV^p = 1000, those of NNPDFl.l use TV^p = 100 only. 
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Figure 6: Same as Fig.[5l but for the valence and nonsinglet PDFs. 
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4.2 The NNPDF1.2 parton set: parton distributions 

The NNPDF1.2 set of parton distributions at the starting scale Qq = 2 GeV^ is displayed 
in Figs.lMZl and compared to the previous sets NNPDFl.O [21] and NNPDFl.l [23]. The 
distances (defined as in Ref. [21]) between each pair of these three sets are shown in 
Tabled 

The general features of this PDF set and its comparison to the previous NNPDF sets 
are the following 

• In the singlet sector, there is very little difference in central values and uncertainties 
between the NNPDF1.2 and NNPDFl.O parton sets: the distance between the sets is 
compatible with statistical fluctuations. The NNPDFl.l, which had an independent 
parametrization for the strange distribution without any data to constrain it displays 
an increase in the uncertainty of the quark singlet due to this unconstrained strange 
contribution. 

• The isospin triplet and the sea asymmetry are the same in all NNPDF sets within 
fluctuations. The total valence has the same central value in all sets within fluctu- 
ations, and the same uncertainty in the NNPDF1.2 and NNPDFl.l sets, while the 
uncertainty on it was somewhat underestimated in NNPDFl.O. This underestimate 
of the NNPDFl.O valence uncertainty was already singled out based on a statisti- 
cal stability analysis in Sect. 5.4 of Ref. [21], where it was suggested that it could 
be cured by a randomization of the preprocessing exponents in Eq. ([4]). This ran- 
domization has been implemented in NNPDFl.l and NNPDF1.2, which indeed have 
somewhat larger valence uncertainty, compatible with each other. This is despite the 
fact that the strange contribution to the total valence is affected by a much larger 
uncertainty in NNPDFl.l than in NNPDF1.2. 

• The central value and uncertainty on the strange distributions Eq. ([5]) are compat- 
ible with those of NNPDFl.l, where strangeness was independently parametrized 
but essentially unconstrained by data, whereas they are incompatible with those 
of NNPDFl.O, where strangeness was determined by the assumptions s^{x,Qq) = 
^ [u + (T) and s~{x, Qq) = 0. This means that this simple assumption, though per- 
haps not too far off, is insufficient to determine the strange distribution within its 
stated accuracy. This conclusion was also reached recently in Ref. [13]. The un- 
certainty on strangeness as we determine it here turns out to be rather larger than 
that induced by the NNPDFl.O assumption, but much smaller than that obtained in 
NNPDFl.l in the absence of dimuon data. It is thus possible to determine the shape 
of s"*" with reasonable accuracy. However, our determination of s~ turns out to be 
compatible with the NNPDFl.O assumption that s~{x,Qq) = 0. We shall discuss 
the features of the strange distribution in greater detail in Sect. 14.41 below. 

4.3 Theoretical uncertainties 

As discussed in Sects. 13. 3113. 4( dimuon data are potentially sensitive to the treatment of the 
quark mass, and neutrino data in general are potentially sensitive to nuclear corrections. 
In order to explore this sensitivity, we have repeated the NNPDF1.2 fit using also for 
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Table 7: Distance between the NNPDFl.O, NNPDFl.l and NNPDF1.2 parton sets. All distances 
are computed from a set of A^i-cp = 100 replicas. 

dimuon data the ZM-VFN scheme (as in Ref. [21]) instead of the improved I-ZM-VFN 
quark mass treatment discussed in Sect. 13.31 and used for the default NNPDF1.2 fit (the 
ZM-VFN is used for all other data anyway). The distances between results thus obtained 
are displayed in Tab.El It is apparent that there is a certain change in the central value of 
the strange distribution in the region of the data, of order of about ten, which, with 100 
replicas, means that the central value has moved by about lAa in units of the standard 
deviation. The uncertainty on itself, and the central value of the singlet distribution in 
the region of the data are affected to a lesser extent, while all other PDFs are unaffected. 
Thus the charm mass corrections displayed in Fig. [2] have a small but noticeable effect 
on the determination of the total strange distribution. Our approximate treatment 
will correspondingly be a source of systematics, which we shall take into account when 
discussing quantities related to strangeness. 

In order to study the sensitivity to the nuclear corrections displayed in Fig. [3] we have 
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Table 8: Distances between PDFs computed from a set of A^rcp — 100 replicas from the default 
NNPDF1.2 set, and 100 replicas obtained using a ZM-VFN scheme instead of the defauh I-ZM-VFN 
scheme of Sect. 13.31 or introducing nuclear corrections computed using the de Florian-Sassot [35] 
and HKN07 [36] models. 

repeated the NNPDF1.2 fit with all neutrino data corrected for nuclear effects according 
to the models of de Florian-Sassot [35] and HKN07 [36]. The distances tabulated in 
Tab. [8] show that the effect of nuclear corrections is negligible: fits with or without nuclear 
corrections differ by an amount which is compatible with statistical fluctuations. 

4.4 Determination of the strange distribution 

The determination of the strange and antistrange PDFs is problematic because of the 
scarceness of the experimental information on these quantities, which makes it difficult to 
separate the genuine information from theoretical bias, a situation which our methodology 
is especially suited to deal with. In previous parton fits, a range of possible shapes of 
the strange PDFs was explored by assuming different functional forms and studying the 
variation of results [13]. 

The s^{x^ Qq), s{x^ Qq) and s{x^ Qg) strange PDFs Eq. ([5]) are shown at the input scale 
in Fig.[8l where they are also compared to the most recent CTEQ6.6 [17] and MSTW08 [15] 
sets. Whereas the CTEQ collaboration has not performed a full determination of the s~ 
uncertainty band, a study of the dependence of the best-fit s~ on assumptions on its 
functional form was performed in Ref. [13]: several of the corresponding results are also 
shown in Fig. [8l For greater clarity, in Fig. [9]we also plot the uncertainties on these PDFs. 

In the data region x > 0.03 all determinations of agree, however the NNPDF1.2 
has a much larger uncertainty than other existing determinations. The origin of this can 
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Fi gure 8; From top to bottom, the strange C-eveii and C-odd combinations Qq)^ s (x, Qq) 

Eq. (O and the corresponding strange s{x, Qq) and antistrange s{x, Qo) PDFs, plotted at the input 
scale versus x on a log (left) or linear (right) scale, computed from the final set of -/Vj-ep = 1000 
replicas. The NNPDF1.2 result is compared to the MSTW08 [15] and CTEQ6.6 [17] global fits. 
For s~ some of the results obtained from the CTEQ6.5s strangeness series [13] are also shown. 




Figure 9: The uncertainty on tlie strange PDFs s^{x,Qq) shown in Fig. [H All bands correspond 
to one a. The relative uncertainty is shown for (left) and the absolute uncertainty for (right). 

be understood by looking at Fig. [TOl where we display 25 randomly chosen replicas out 
of our full set, and the mean and standard deviation computed from them: clearly, our 
large uncertainty is a consequence of the great flexibility afforded by the neural network 
parametrization. This is particularly noticeable in the case of s~ , which must have at least 
one node because of the sum rule Eq. ([9]): individual replicas cross the x-axis in different 
places, with different sign (from positive to negative or conversely), and some replicas have 
more than one crossing. It is interesting to observe that the "neck" in the uncertainty on 
s~ around x ~ 0.1 corresponds to the value of x at which the crossing is most likely to 
occur. The role played by the valence sum rule Eq. ([9]) in determining these features of 
the strangeness asymmetry s~ can be elucidated by repeating the fit without imposing it. 
The results, displayed in Fig. \TT\ show that even without the sum rule constraint many 
replicas still cross the the x-axis. 

Another theoretical constraint which may help in reducing uncertainties is that of 
positivity of cross sections. For instance, as in Ref. [21], in the determination of the 
NNPDF1.2 PDF set we have imposed positivity of the structure function at low x and 
Q^, which helps in reducing the uncertainty of the gluon distribution at the edge of the 
HERA data region. In view of the fact that (see Fig. [8]) both s{x,Ql) and s{x,Q'l) can 
turn negative to within one sigma for x < 10~^, and also in the large x > 0.2 region, one 
may wonder whether imposing positivity of the dimuon cross section might likewise help in 
reducing the uncertainty on the strange and antistrange distributions. In order to test this, 
in Fig. [T2]we display the total dimuon cross section, both at the initial = Qq = 2 GeV^ 
and at the typical scale of the NuTeV data = 20 GeV^, computed using the NNPDF1.2 
PDFs of Fig. [HI The cross section only becomes significantly negative at low and very 
low X < 10~^. For antineutrinos, it also become somewhat negative at large x: at the 
scale of the large x data > 20 GeV^ for x > 0.3. We conclude that the constraint of 
positivity only affects physical observables quite far from the data region. We have thus 
not imposed this constraint in the current fit. It might be worth implementing it in future 
fits which include Drell-Yan data, as these could further constrain strangeness, especially 
at large x. 

Further constraints could be based on theoretical expectations: for example, one may 
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Figure 10: A set of randomly chosen Ny^p — 25 replicas of the strange PDFs s~^{x, Qq), s {x, Qq) 
out of the full set of Fig. [51 and the PDFs computed from them. 



expect the strange PDF to be smaller than the light quark valence PDFs; indeed, the 
systematic implementation of theoretical or model constraints in parton fits has been 
advocated e.g. in Ref. [39]. However, expectations based on models of the nucleon have 
often turned out to be in disagreement with experiment: for instance, in the polarized 
case the strange distribution turns out to be unexpectedly large and in fact larger than 
the up distribution (see e.g. Ref. [40]). To obtain reliable phenomenology, such as the 
determination of electroweak parameters to be discussed below, we prefer therefore to only 
rely on exact constraints, such as the valence sum rule or positivity. 

The features of the strange distributions which are most interesting for physics ap- 
plications (as we shall discuss in more detail in Section [5]) are the momentum fractions, 
defined as ^ 

[S^] [Q^) = I dxxs^{x,Q^) , (16) 
Jo 

with similar definitions for moments of other PDF combinations, and in particular their 
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Analysis 


Reference 


Ks (Q^ = 20 GeV^) 


NNPDF1.2 
MSTW08 
CTEQ66 
AKP08 


This work 
[15] 
[17] 
[14] 


0.56 ± 0.03 
0.72 ±0.05 
0.59 ± 0.08 



Table 9: The relative strange momentum fraction Ks{Q'^) Eq. Ill7p . as determined from various 
parton sets. All uncertainties correspond to 68% confidence levels. 



ratio to the light sea or respectively light valence momentum fractions: 

K (Q') ^ Iidxxs+ {x,Q') ^ [S+] 

J^dx x{uix,Q^) + dix,Q^)) [U + D]' 

, [} dxxs~ (xjO"^) \S~] , , 

RsiQ^) = 2—. ^— = 2- — ^ — ^ — - . 18 

fQdxx{u~{x,Q^)+d-{x,Q^)) [U- + D-] 

In many parton fits, including the NNPDFl.O fit, these quantities are taken to be fixed at 
the starting scale: the value of the relative total strange momentum (sometimes also called 
strange suppression) is, since the earliest measurements, taken to be [41] Ks{Qq) ~ 0.5, 
while the strange asymmetry is assumed to vanish, i.e. Rs{Qq) = 0. 

The value and uncertainty on these quantities can be determined from the NNPDF1.2 
set by performing averages over replica PDFs [21], which for Ks and Rs will not necessarily 
coincide with the ratio of average PDFs, because Eqs. (jl7til8p are not linear in the PDFs. 
In fact, because the denominator in Eq. (jl7p can assume rather small values, we expect 
that the distribution of values of the total strange fraction Ks can be rather asymmetric 
and non-gaussian. The probability distribution of Ks at = 20 GeV^ is shown in 
Fig- fT3l and turns out to be indeed quite far from gaussian. Therefore, we compute the 
one-cr uncertainty as a central 68% confidence integral, namely requiring the two outer tails 
of the probability distribution (lighter blue region in Fig. [T^ to each correspond to 16% 
probability, with the central value still given by the average. The result we thus obtain for 
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NNPDF1 .2, - 20 GeV^. y = 0.4 



NNPDF1 .2, = 20 GeV^, y - 0.4 



Neutrino - central value ■ 
Neutrino - 1-ct range 
AntiNeutrino - central value ■ 
AntiNeutrino - 1-a range 




Neutrino - central value ■ 
Neutrino - l-o range 
AntiNeutrino - central value ■ 
AntiNeutrino - l-o range 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 



NNPDF1 .2, 0^ = 2 GeV^, y - 0.4 



NNPDF1 .2, - 2 GeV^, y = 0.4 



Neutrino - central value ■ 
Neutrino - 1-a range 
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AntiNeutrino - 1-a range 
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Neutrino - l-o range 
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Figure 12: The total neutrino and antineutrino dimuon cross sections at the starting scale Qq 



2 GeV^ (lower row) and at the "NuTeV" scale Ql 
log (left) or hnear (right) scale. 



20 GeV (upper row), plotted versus x on a 
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NNPDF1 .2, Q2 = 20 GeV^, N = 1 000 

rep 




0.5 1 _ _ 1.5 

Ks = [ / [ U + D ] 



Figure 13: Probability distribution of Ks at — 20 GeV^ computed from the reference set of 
-^rop = 1000 NNPDF1.2 PDF replicas. The central cross-hatched region corresponds to the central 
68% confidence interval, Kg {Q^ = 20GeV2) = O.TllEJ J?'''^*. 



the expected Ks and its uncertainty are shown in Table [9l along with the results found 
using other parton sets. The median of the probability distribution is equal to Kg^'^'^ = 
0.59, significantly different from the average because of the asymmetry. The NNPDF1.2 
uncertainty is much larger than that found in other fits, for the reasons discussed above. 
Note that, however, all values are essentially consistent with the simple assumption Ks = 
0.5 used in older parton fits. 

In the case of the strange momentum asymmetry Rs Eq. (jlSp the denominator is 
fixed by knowledge of the valence content of the nucleon, which is known quite accurately: 
hence we expect the uncertainty to be symmetric and dominated by uncertainty of the 
numerator. Indeed, the probability distribution for Rs, shown in Fig. [T4l turns out to be 
approximately gaussian so that the uncertainty computed from the central 68% confidence 
essentially coincides with the standard deviation of the distribution, while central value 
and uncertainty for Rs are essentially proportional to those of the strangeness asymmetry 
[S~]. This latter quantity has been determined by various groups, at various scales: several 
of these results are collected in Table fTOl and compared to our own. Results are given both 
at the scale at which they were determined, and then also when evolved to a common scale, 
exploiting the fact that at NLO (though not at NNLO [43]) [S~] evolves multiplicatively. 
In this case, too, the NNPDF1.2 uncertainty is much larger than that obtained in other 
fits: while for all other groups there is an indication that a positive value of [S~] is favored 
(all results being nevertheless compatible with zero), this indication looses its significance 
in our analysis due to the very large uncertainty. 

In the next section we will see that, surprisingly, even with such large uncertainties it 



29 



NNPDF1 .2, Q2 = 20 GeV^ N =1000 




Rg = 2 [ S" ] / [ U + D" ] 



Figure 14: Probability distribution of i?5 at = 20 GeV^ computed from the reference set of 
-^rcp — 1000 NNPDF1.2 PDF replicas. The central cross-hatched region corresponds to the central 
68% confidence interval, Rs {Q^ = 20GeV^) = 0.006 ± 0.045"'^*. 



Analysis 


Reference 


[S-] (Q") • 10^ 


[GeV] 


[S-] (Q^^^f = 20 GeV^) • 10^ 


NNPDF1.2 


This work 


0.5 ±8.6 


20 


0.5 ±8.6 


MSTW08 


[15] 


2.4 ±2.0 


1 


1.7 ± 1.4 


CTEQ6.5S 


[13] 


2.0 ± 1.8 


1.69 


1.6 ± 1.4 


CTEQ6.1S 


[11] 


1.5 ± 1.5 


1.69 


1.2 ± 1.2 


AKP08 


[14] 


1.0 ± 1.3 


20 


1.0 ± 1.3 


NuTeV07 


[10] 


2.2 ± 1.3 


16 


2.2 ± 1.3 


BPPZ03 


[42] 


1.8 ±3.8 


20 


1.8 ±3.8 



Table 10: Comparison of various determinations of strangeness momentum asymmetry [S~] (Q^) 
Eq. All uncertainties correspond to 68% confidence levels. Both the published value is given, 

and the value obtained evolving to Q^.^^ = 20 GeV^ through NLO perturbative evolution. 
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Kg (mean) 




Reference 


'-'•'-'^-0.31 


(6 ± 45) • 10-3 


ZM-VFN 
Nuclear - dFS03 
Nuclear - HKN07 
LO 


-0.20 

0.681°!^ 

'-'•"-'^-0.22 


(8 ± 39) • 10-3 
(12 ± 48) • 10-3 
(0 ± 40) • 10-3 
(1±38) • 10-3 


No strange SR 


U.DZ_Q 21 


(17 ±32) • 10-3 



Table 11: The strange relative total and valence momentum fractions Ks and Rs, Eqs. (jlTllSp . 
at the scale = 20 GeV^. The first row gives the value computed from the reference NNPDF1.2 
set of Nrcp — 1000 replicas, while the other rows give results from sets of A^rep — 100 replicas each 
obtained from alternative fits discussed in the text. All uncertainties are one-cr or 68% central 
confidence intervals. 

is possible to exploit our determination of Ks and Rs for a determination of electroweak 
parameters. In view of this, it is useful to also study possible sources of systematic uncer- 
tainty on these quantities. Possible significant sources of systematics are the following: 

• Heavy quark mass effects. The treatment of heavy quark mass effects entails various 
ambiguities related to the prescription used to deal with subleading terms [33] . In our 
case, a further source of systematics is due to the fact that the charm quark mass 
is treated approximately, using the I-ZM-VFN scheme as discussed in Sect. 13.31 
and then only for dimuon data. The corresponding uncertainty is conservatively 
estimated by repeating the fit in a pure ZM-VFN scheme. 

• Nuclear corrections. Their effect is estimated by repeating the fit with CHORUS 
and NuTeV data corrected using the de Florian-Sassot [35] and HKN07 [36] models. 

• Higher order QCD corrections. These are very conservatively estimated by repeating 
the fit at LO. 

The results from Ks and Rs obtained in each of these cases are compared in Table [TT] 
to the reference NNPDF1.2 result, all at the scale = 20 GeV^. It is apparent that the 
effect of any of these systematics is rather moderate, even if very conservatively estimated. 
In the same table we also show the result of a fit in which the sum rule Eq. ([9|) is not 
imposed: even in this case the result changes very little. 

Estimating the effect of the systematics from the sum in quadrature of the shift of 
central values due to the four central rows of Table E] we get, at = 20 GeV^ 

Ks = 0.7ll°:^f*''*±0.26^y^*, (19) 
Rs = 0.006 ±0.045"*''* ±0.010'^'*. (20) 

The systematics on Rs is thus negligible, and mostly due to nuclear effects. The system- 
atics on Ks is not quite negligible, and almost entirely due to the treatment of the heavy 
quark mass: this is an aspect of our analysis which could be improved in the future within 
a more accurate treatment of quark mass effects. 
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Figure 15: Comparison between the NuTeV data and the NNPDF1.2 theoretical predictions for 
neutrino (upper three rows) and anti-neutrino (lower three rows) dimuon production. All cross 
section in the plots are rescaled by a factor Ajj — ' neutrino kinematics parameters 

{Ey^y) are related to x and by Eq. The solid line is the central NNPDF1.2 prediction and 
the dashed lines the l-cr interval. 
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Figure 16: Comparison witli tiie experimental determination [26] of the interference structure 
function xF^^{x, Q'^)a,t = 5000 GeV^ with the NLO prediction obtained using the NNPDFl.O 
and NNPDF1.2 parton sets. 



4.5 Comparison with experimental data 

The results obtained from a NLO computation of the dimuon cross section from the 
reference set of NNPDF1.2 parton distributions with A'rep = 1000 replicas are compared 
in Fig. [15] to the NuTeV experimental data. The agreement is clearly excellent in all bins 
except for the lowest values (bottom left plot), where the approximate treatment of 
the quark mass leads to a deterioration in quality of the fit. 

In Ref . [26] , an extraction of the interference parity-violating structure function xF^^ 
evolved to a common scale Q2 = 5000 GeV^ was also presented. This extraction is based 
on data already included in our fit, so these data do not provide any extra information. 
However, it is interesting to compare directly to it, because this structure function is 
directly sensitive to the flavour and valence/sea decomposition of PDFs (specifically to 
strangeness), which is difficult to probe directly (see e.g. Ref. [21], appendix A. 2). Because 
the contribution of xF^^ to the total reduced cross-section is small and only relevant in 
a limited region of phase space, the agreement between data and theory for this quantity 
could in principle be poor without this being significantly reflected in the quality of the 
global fit. 

A comparison of these data with the NLO prediction obtained using the NNPDFl.O 
and NNPDF1.2 parton sets is shown in Fig. [161 ^^'^ shows good agreement: = 1-53 for 
NNPDF1.2, and = 1-55 for NNPDFl.O, comparable to the value for other data in the 
valence region (despite the fact that for the NNPDFl.O fit neither the data of Ref. [26] 
nor dimuon data were used). The widening of the uncertainty band when going from 
NNPDFl.O to NNPDF1.2 is a consequence of the sensitivity of this structure function to 
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a{W+)Br {W+ l+ui) 


a{W-)Br (W- l+ui) 


a(Z")Br (Z" ^ 1+1-) 


NNPDF 1.0 


10 TeV 
14 TeV 


8.49 ± 0.18 
11.83 ± 0.26 


5.81 ± 0.13 
8.41 ± 0.20 


1.36 ± 0.02 
1.95 ± 0.04 


NNPDF 1.1 


10 TeV 
14 TeV 


8.52 ± 0.33 
11.86 ± 0.46 


5.79 ± 0.28 
8.38 ± 0.39 


1.36 ± 0.04 
1.95 ± 0.06 


NNPDF 1.2 


10 TeV 
14 TeV 


8.61 ± 0.25 
11.99 ± 0.34 


5.85 ± 0.15 
8.47 ± 0.21 


1.37 ± 0.03 
1.97 ± 0.04 



Table 12: Cross sections for gauge boson production at the LHC. All quantities have been com- 
puted at NLO using MCFM [44-47] and NNPDF partons. 

valence combinations, and strangeness in particular: very precise measurements of it could 
greatly improve flavour separation of PDFs. 

A detailed study of the phenomenological implications of our reassessment of the 
strangeness uncertainty for LHC observables is beyond the scope of this work. However, 
in Table [T2] we collect the total cross section for W and Z production computed at NLO 
with MCFM [44-47]: results obtained with the NNPDF1.2 and NNPDFl.l parton sets 
are compared to those found using NNPDFl.O. already discussed in Ref. [21]. Because of 
the increased uncertainty on the strange distribution, the uncertainty in the cross section 
is larger in NNPDFl.l and NNPDF1.2, though less so in NNPDF1.2 due to the constraint 
from dimuon data. 
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5 Precision determination of electroweak parameters 



Neutrino DIS data, and especially dimuon data, can be used to perform direct mea- 
surements of electroweak parameters [48,49]. However the potential precision of these 
measurements can be spoiled by PDF uncertainties. Indeed, we have seen in Sect. |3]that 
the uncertainties we obtain on the strange distributions are quite large, typically larger 
by almost one order of magnitude than those found in previous global fits. 

The CKM matrix elements control the strength of the coupling of various partons to 
neutrinos according to Eqs. (jllll2p . In spite of the large PDF uncertainties in the strange 
sector, we shall provide here the most precise direct determination up to date of the CKM 
matrix element \Vcs\ within a single experiment. We will also provide a determination 
of \Vcd\ with an accuracy consistent with previous results from neutrino data. These 
remarkable results are possible because PDF uncertainties are free from parametrization 
bias, thus they may be disentangled from the uncertainty on the physical parameters. 

We will then turn to a study of the impact of PDF uncertainties on the extraction of 
the electroweak mixing angle sin^ from the Paschos-Wolfenstein ratio: we will show 
that once PDF uncertainties are properly taken into account, the NuTeV measurement of 
this ratio [1] is in full agreement with the standard model prediction. 

5.1 Determination of \Vcs\ and \Vcd\- 

Since the pioneering CDHS studies [41] , neutrino DIS has been used as a means to directly 
determine CKM matrix elements: the parton-model expressions for the neutrino and 
anti- neutrino dimuon production Eqs. (|ll|12p provide two equations which relate two 
experimentally measurable cross sections to the two unknowns \Vcd\ and \Vcs\- 

However, these equations also contain as unknowns the second moments of the light 
quark PDFs (the total cross section is proportional to the second moment of the PDF). The 
standard lore [24,31,41] is then that if one assumes that S~ ~ 0, the linear combination 
jpu,c _ pu,c Qj^^y (igpgnds on the \ Vcd\ and the u and d valence components, which are well 
measured by other experiments, so it can be used to determine \Vcd\- On the other hand, 
the orthogonal combination F!^''^ + F!^'"^ depends on the | Ksl/lKdj ratio, but also on Ks 
Eq. (fT7|) . and thus it can only be used to determine the combination iVcsj^s- Indeed, the 
PDG [24] quotes a value of \Vcd\ = 0.23 it 0.11 obtained from the average neutrino dimuon 
experiments as the best current direct determination. Only the bound \ Vcs\ > 0.74 at 90% 
confidence level [31] was quoted in previous PDG [50] editions, but this is now superseded 
by a direct determination \Vcs\ = 1-04 it 0.06 from D decays (for a recent update, see 
Ref. [51]). Of course, the values obtained from the current global CKM fits [24,52,53] are 
much more precise than these direct determinations (see Table O below). 

In the NNPDF1.2 reference fit, \Vcd\, \Vcs\-, and \Vcb\ are each fixed to the current PDG 
value [24], obtained from the global CKM unitarity fit. We now show that, thanks to the 
fact that we are free of bias related to the parametrization of strangeness, we can extract 
both \Vcs\ and \Vcd\ from the fit. In order to do this, we perform a scan over the values 
of \Vcs\ and \Vcd\ used in the fit, holding \Vcb\ fixed, but relaxing the unitarity constraint 
(in practice, because of its smallness, the precise value chosen for \ Vcb\ is inconsequential). 
The best-fit value and uncertainty for the CKM parameters are then determined in the 
standard way by maximum likelihood from the profile. 
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NNPDF1 .2, Nrep = 500, |Vcdl=0.2256 
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Figure 17: The of the NuTeV dimuon data as a function of |Vcs| when \ycd\ is kept fixed at its 
best unitarity fit value. The long-dashed curve is the parabolic fit from which the central value and 
one-cr uncertainty Eq. ((2T|) are obtained; the short-dashed curve is a parabolic fit to the central 
and two outer points only. 

The determined from a set of A'^dat data points fluctuates, with a standard deviation 
equal to = \/2iVd^. In order to determine the profile as the underlying parame- 
ters are varied, these fluctuations must be kept under control. Within our Monte Carlo 
approach, this could be done by using the same set of data replicas each time the is 
recomputed with different values of the underlying parameters. This might however bias 
the result in a random way depending on the particular set of replicas which has been cho- 
sen in the first place. We prefer thus to vary randomly the set of replicas which is used for 
different parameter values: fluctuations are then kept under control by using a sufficiently 
large set of replicas, given the fluctuation of the computed from a replica average has 
a standard deviation equal to j ^fW^. Because only dimuon data are sensitive to the 
CKM matrix elements, we can determine their values from the dependence of the x^ of 
the flt to these data only, rather than for that of the flt to the global dataset. Because 
we have (see Tab. [T]) 84 dimuon data points, a set of A'rep = 500 replicas is sufficient to 
guarantee that point-by-point fluctuations are smaller than Ax^ = 1- 

First, we vary independently each of the two CKM matrix elements, keeping the other 
fixed at its central value in the CKM unitarity fit. The x^ profile is computed for five 
equally spaced values of the parameter which is being varied. The values have been chosen 
on the basis of a preliminary exploration of the space of parameters based on fits with a 
small number of replicas; they are displayed in Fig. [T9j The ensuing x^ profile is displayed 
in Fig. [T7]for and in Fig. [18] for We observe well-defined minima in both cases. 
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Figure 18: The of the NuTeV dimuon data as a function of |Vcd| when |Vcs| is kept fixed at its 
best unitarity fit value. The log-dashed curve is the parabolic fit from which the central value and 
one-cr uncertainty Eq. ([^^ are obtained; the short-dashed curve is a parabolic fit to the central 
and two outer points only. 
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\Vcd\ 


\Vcs\ 


Statistical 


±0.012 


±0.05 


Mass effects 
Higher order QCD 
Nuclear corrections 


±0.007 
±0.010 
±0.008 


±0.02 
±0.03 
±0.03 


Total systematic uncertainty 


±0.014 


±0.05 


Total uncertainty 


±0.019 


±0.07 



Table 13: Summary of statistical and systematic uncertainties in the present determination of 
\V,s\ and \V,d\- 

A parabolic fit leads to 

\Vcs\ = 0.93 ±0.06, (21) 
\Vcd\ = 0.248 ±0.012, (22) 

where the one-o" uncertainty is obtained from the condition A^^ = 1 • The fit is quite stable 
upon the choice of different subsets of the five available points: if it is repeated by only 
retaining the central and two outer points neither the central values nor the uncertainties 
Eqs. ()2m22p vary significantly. This confirms that the number of replicas used to compute 
the is sufficiently large for the result not to be biased by statistical fluctuations. Both 
fits are shown in Figs. [T71I18I 

This shows that either CKM matrix element can be determined from our data, with 
comparable uncertainty, by taking the other fixed. We can thus perform a simultaneous 
determination of both these CKM matrix elements. In order to improve the accuracy 
of this determination, we compute the at four more points in the (|V"crf|, \Vcs\) plane, 
denoted by squares in Fig. [191 The iii these additional points is computed from a 
smaller set of A'rep = 100 replicas. The result of the combined fit is then 

\Vcs\ = 0.96 ±0.05, (23) 
\Vcd\ = 0.244 ±0.012. (24) 

The uncertainties turn out to be almost identical to the diagonal uncertainties, and the 
correlation coefficient is relatively small p = 0.21, reflected in a moderate shift in central 
values in comparison to the separate fits Eqs. (121ll22p . The location of the best-fit point 
and one-cr (A^^ = 1) ellipse in the (|V"crf|, \Vcs\) plane for the best-fit paraboloid is 
shown in Fig. [20l 

This determination Eq. (j24p is affected by the same systematics that we examined in 
Sect. 14.31 namely, higher order QCD corrections, treatment of heavy quark effects and 
modeling of nuclear corrections. In order to assess their impact in the CKM element 
determination, we have repeated the determination of each of the two parameters as the 
other is kept fixed, Eqs. (I21ll22p . by recomputing the for a smaller set of N^^^ = 100 
replicas along the points denoted as circles in Fig. 1191 with each of these three effects varied 
in turn as we did in Sect. 14.31 We then take the shift in central value as an estimate of the 
corresponding uncertainty. The results are summarized in Table [T3l Putting everything 
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Figure 19: The grid of points used in the determination of the CKM matrix elements \Vcs\ and 
\Vcd\- Open circles denote points used for the determination of \Vcd\ Eq. ([22|) . and full circles points 
used for the determination of \Vcs \ Eq. (pij) . All points are used in the joint determination Eq. f^^. 

together, we find 

\Vcs\ = 0.96 ±0.07*°* , (25) 
\Vcd\ = 0.244 ±0.019*°*. (26) 

In Table [T3] we compare our final results Eqs. (|25ll26p with the best CKM unitarity 
fit results and with other direct determinations. Our determination of \Vcd\ is consistent 
with other direct determinations, and of comparable accuracy, though one should bear 
in mind that previous determinations from dimuon data were based on fits with a fixed 
functional form, and thus subject to potentially large systematics bias. Our determination 
of \Vcs\ is rather more accurate that any other direct determination from dimuon data, 
more accurate than any single direct determination, and of comparable accuracy to the 
PDF average of determinations from D decays. 
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Figure 20: Location of the best- fit point and one-cr (statistical Ax^ — 1 uncertainty) ellipse in 
the (iVcdl, \Vcs\) plane for the best-fit paraboloid obtained from the computed at the points 
displayed in Fig. [TH The best unitarity fit result [24] is also shown for comparison. 
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Analysis 



Description 



Reference 



IK, 



0.96 ± 0.07*°* 

> 0.59 (90% C.L.) 

> 0.74 (90% C.L.) 

1.04 ±0.06 
1.04 ±0.16 
OMtoil ± 0.13 
0.97334 ± 0.00023 



NNPDF1.2 

CDHS 
CCFR 
PDG08 
Hockcr 
DELPHI 
PDG08 



Direct determination from global PDF analysis 
LO detcrniiuation from vN jj.'^ fi^ X 
NLO determination from vN —^ ^'^ n~ X 
Averages of determinations from D decays 
Averages of determinations from vN X 
Direct measurement from — > cs decays 
CKM unitaxity fit 



This work 

[41] 
[28,31] 
[24] 
54] 
55] 
24] 



Analysis 


Description 


Reference 


\Vcd\ 


NNPDF1.2 
CDHS 
CCFR 
PDG08 
PDG08 
PDG08 


Direct determination from global PDF analysis 
LO determination from vN — > /j,'^ n~ X 
NLO determination from vN — > ji^ ^jT X 
Averages of direct determinations from vN fj,'^ fj,~X 
Average of determinations from D K/nlv decays 
CKM unitarity fit 


This work 
[41] 
[31] 
[24] 
[24] 
[24] 


0.244 ±0.019*°* 

0.24 ±0.03 
P 909+0.017 

0.230 ±0.011 
0.218 ±0.023 
0.2256 ±0.0010 



Table 14: Comparison of the present determination of the CKM matrix elements |V"cs| (upper 
table) and \Vcd\ (lower table) with other available direct measurements, averages and CKM con- 
strained fits. 
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5.2 PDF corrections to the Paschos-Wolfenstein ratio 



The successful determination of the CKM matrix elements which control charged current 
scattering suggests that we might use our parton set for a reliable reassessment of the 
determination of the coupling which controls neutral current neutrino DIS. As is well 
known [56], this coupling depends on the electroweak mixing angle, which can thus be 
extracted from its experimental measurement. Specifically, in the parton model one has 

a{vM vX) - a{uM vX) 
^ a{vM ^ IX) - a{vM ^ IX) 

(27) 



^ - sin^ % + 



[2i 6 



where is the electroweak mixing angle, [S ] is the strange valence momentum fraction 
Eq. (jl6p . [D~] and [C^] the valence momentum fractions of other quark flavors, and 

[Q-] = ([[/-] + [D-])/2. 

The recent experimental determination [1] 



sin^ 6'w 



= 0.2277 ± 0.0014"*''* ± 0.0009'^' = 0.2277 ± 0.0017*°* , (28) 

NuTeV 



is obtained using Eq. ()27l) under the assumption that for an isoscalar nucleon target \U~]- 
[-D^] = [C^] = [S'^]=0, so the term in square brackets in Eq. (p7|) vanishes. Of course, the 
NuTeV iron target is not exactly isoscalar; however, the corresponding correction can be 
computed [1] with small uncertainty [2]. The result Eq. (j28p disagrees at the three-o" level 
with the value determined in global precision electroweak fits, such as [57,58] 



sin^ 



= 0.2223 ± 0.0003 . (29) 

EWfit 



Possible explanations for this include nuclear effects, electroweak corrections, QCD correc- 
tions, and physics beyond the standard model [2] (see e.g. [59] for an updated list of refer- 
ences). However, one may also [2] question the validity of the assumption of the vanishing 
of the contribution in square brackets in Eq. (j27p . The possibility that \U~] — [D~] ^ 
even for an isoscalar target due to isospin violation induced by QED evolution effects was 
discussed in Ref . [60] : it could easily explain about a third of the observed discrepancy. 

In our fit, isospin symmetry is assumed, and furthermore [C^] = 0. We are then left 
with the correction 

6s sin^ e^N = -Rs- (3 - 7sin2 9^^) , (30) 
6 

with Rs defined in Eq. ([TH]) . Using the value of Rs Eq. ([20^ . obtained at the typical scale 
= 20 GeV^ of the NuTeV data (and whose scale dependence is very small anyway [43]) 
we obtain 

5s sin^ % = -0.001 ± 0.011^°^' ± 0.002*^, (31) 

where the theoretical uncertainty comes from the effects discussed above in Sect. 14.31 and 
it is not to be confused with the experimental systematics in the NuTeV measurement 
Eq. 

Even neglecting these theoretical uncertainties (which we estimated very conserva- 
tively), the additional PDF uncertainty due to strangeness alone is thus about twice the 
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Determinations of tine weal< mixing angle sin^G^^ 





NuTeVOI NuTeVOt Global EW fit 
+ NNPDF1.2 [S"] 












: 




I 









Figure 21: Comparison between the NuTeV determination of sin^ (?w, Eq- ([28|) . the resuh from the 
global electroweak fit, Eq. (I^S)) . and the NuTeV result after the correction due to the uncertainty 
on S- Eq. 



observed discrepancy in sin^ ^w- We must conclude therefore that the apparent incon- 
sistency between the NuTeV measurement and the global electroweak fit disappears once 
the uncertainty on the strange distribution is properly taken into account. Applying the 
correction Eq. (|31[) the NuTeV result becomes 

= 0.2263 ± 0.0014'*^* ± 0.0009'^" ± 0.0107™^". (32) 

NuTcV 



sin^ 



We recommend that the corrected result Eq. ([32]) be used, for instance in global electroweak 
fits. This corrected result is compared graphically in Fig. [21] to the original NuTeV result 
Eq. (128p and the result from the global electroweak fit Eq. ([29p . 
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6 Conclusions and outlook 



We have presented an upgrade of the NNPDFl.O parton set, which now includes an inde- 
pendent parametrization for the strange distributions, and the inclusion of dimuon data 
which constrain them. Besides being an intermediate step towards a fully global fit includ- 
ing hadronic data, our results arc interesting as a test of the NNPDF methodology, as a 
state-of-the art determination of the strange PDFs, and as a determination of electroweak 
parameters. 

We have shown that the NNPDF approach has no difficulty in dealing with situations 
where experimental information is scarce and only provides loose constraints on the form 
of parton distributions. Within our approach, this does not require the introduction 
of theoretical assumptions or constraints in order to obtain stable results. We can thus 
provide reliable estimates of uncertainties, free of bias induced by theoretical assumptions. 

We have obtained a determination of the strange momentum fraction and of the 
strangeness valence component, which, though in agreement with previous determina- 
tions, turn out to be affected by uncertainties which are sizably larger than those found 
by other groups. 

Nevertheless, we have shown that, with the uncertainty on the strange PDF carefully 
estimated, the dimuon data can be used to provide a good determination of the CKM 
matrix elements IV^^I ^^'^ iKsl- In particular, our determination of \Vcs\ is the most 
accurate ever obtained from neutrino deep-inelastic scattering data, and it is also more 
accurate than any individual direct determination from D decays. We have also shown that 
once PDF uncertainties are estimated reliably, the value of the electroweak mixing angle 
extracted from NuTeV inclusive data is in agreement with standard model expectations. 

The main defect of our results is that they arc still based on an approximate treatment 
of the charm mass. Within the context of the present work, the only significant implication 
of this is a slight increase in the systematic uncertainty on our determination of \Vcd\- 
However, this also entails a further small but non-negligible systematic uncertainty in our 
determination of PDFs [21]. 

It will be interesting to study the implications for LHC obscrvables of this reassessment 
of the uncertainty on the strange distribution. The NNPDF1.2 release is available from 
the webpage of the NNPDF Collaboration http://sophia.ecm.ub.es/nnpdf/. 
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A Kernels for Physical Observables 

In this appendix we expand the physical observables for dimuon production in the evolution 
basis of the PDFs, and derive expressions for the kernels, in the same way and using the 
same notation as in Appendix A of Ref. [21]. All convolutions may be performed either 
in the ZM-VFNS or in the I-ZM-VFN scheme, as discussed in Sect. 13.31 

The cross-section for charm production in neutrino scattering off an isoscalar nucleon 
is given by Eq. ([1]), which we write as 

^u[u),c ^ ^^Y+F^i-),'^ - y^Ff^'''^ ± y_ xF^'^^^ (33) 

where 

- v, = fr,-HM|V_,Af:,;iV.^ (34) 



27r(l + g2/M2,)2' + V )\ Q\ 

Taking into account a possible non-isoscalar component of the nuclear target by defin- 
ing T = 1 — in the quark model we have 

pu,c ^ pu,c ^ ^pu,c ^ ^ ^\v,a\\{l + t)u + (1 - T)d) + 2\V,s\^s + 2\V,t\^b) , (35) 

F^'" = Fl'" = -X F^^' = X {\V,d\\{l + T)u + {l-T)d) + 2\V,s\^s+ 2|Kb|'6) (36) 

where all explicit dependence on x and has been dropped. In terms of the PDF 
evolution eigenstates we then have 

+ lws{Ts ± Vs) + ^Wi5{Tl5 ± Vi5) + ^W2a{T24 ± V24) + ^Wo{T35 ± ^35)},(37) 

where the +(-) sign corresponds to neutrino (anti-neutrino) scattering, and the CKM 
factors are 

Wo = \Vcd\'^ + \Vcs\'^ + \Vcb\'^ , W3 = \Vcd\'^, Ws = \Vcd\'^ - 2\Vcs\'^ , 

wi5 = \Vcd\^ + \Vcs\^ W2A = \Vcd\'' + \Vcs\^ -MVa\^. (38) 

Unitarity of the CKM matrix is imposed setting = Ij in the CKM determination in 
Sec. 5.2 it is however left unconstrained. Below b threshold Vcb = 0, so wo = wis = W2a- 

In perturbative QCD the charm production neutrino structure functions thus take the 
form 



Uf 



3) 



+ ^W8{T8 ± Vs) + ^Wi5{Ti5 ± Vis) + ^W2a{T2A ± V2a) + ^Wo{Ts5 ± ^3s)},(39) 



3) 

+ lws{Ts ± Vs) + ^Wi5{Ti5 ± Vis) + ^W2AiT24 ± V24) + ^WoiTs^ ± l^3s)},(40) 

where i = 2,L and is the number of active flavours. We can thus write the charm 
production neutrino cross-sections as 

a'''-'^'^ = Kx{Kf ^'' So + Kf^''' 0go± Kf ^'' Vo 

±K"(")'" (1^3^3,0 + ^WsVs,0 + l^u;i5^15,0)}, (41) 
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where in Mellin space the kernels are 

^.(p),c ^ (y^C2,,-y^CL,, + y_C3,,)r+g, 
^.(p),c ^ (y+C2,,-y^CL,, + y_C3,,)r^s. 
Below the h threshold the singlet kernels simplify to 

Kf^'' = lw,,{Y+Cl^-y'Cl^ + Y_Cljns- 
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